Using lexical and corpus resources for augmenting the AAC-lexicon
نویسندگان
چکیده
A corpus of easy-to-read texts in combination with a base vocabulary pool for Swedish was used in order to build a basic vocabulary. The coverage of these entries by symbols in an existing AAC database was then assessed. We finally suggest a method for enriching the expressive power of the AAC language by combining existing symbols and in this way illustrate additional concepts.
منابع مشابه
Induction of Treebank-Aligned Lexical Resources
By ‘treebank-aligned lexical resources’ we mean ones where there is a systematic correspondence between the lexical resource and treebank syntactic resources. For instance, the lexicon resource contains features representing the subcategorization frames of verbs, which correspond to structural configurations that the verb occurs in, in a treebank. Given such an alignment, a treebank can be comp...
متن کاملBuilding lexical semantic representations for Natural Language instructions
We report on our work to automatically build a corpus of instructional text annotated with lexical semantics information. We have coupled the parser LCFLEX with a lexicon and ontology derived from two lexical resources, VerbNet for verbs and CoreLex for nouns. We discuss how we built our lexicon and ontology, and the parsing results we obtained.
متن کاملLERIL: Collaborative Effort for Creating Lexical Resources
The paper reports on efforts taken to create lexical resources pertaining to Indian languages, using the collaborative model. The lexical resources being developed are: (1) transfer lexicon and grammar from English to several Indian languages, and (2) dependency tree bank of annotated corpora for several Indian languages. The dependency trees are based on the Paninian model. (3) is an attempt t...
متن کاملAUTOLEX: An Automatic Lexicon Builder for Minority Languages Using an Open Corpus
The aim of this study is to build natural language resources for languages with limited resources or minority languages. Manually building these resources is tedious and costly. These natural language resources such as a language corpora and lexicon will be used for natural language processing research and system development. Tagalog, a minority language was considered in this study as a test b...
متن کاملEnhancing Speech Corpus Resources with Multiple Lexical Tag Layers
We describe a general two-stage procedure for re-using a custom corpus for spoken language system development involving a transformation from character-based markup to XML, and DSSSL stylesheet-driven XML markup enhancement with multiple lexical tag trees. The procedure was used to generate a fully tagged corpus; alternatively with greater economy of computing resources, it can be employed as a...
متن کامل